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ABSTRACT We have previously described a set of human 
T-lymphocyte-specific cDNA clones isolated by a modified 
differential screening procedure. Apparent full-length cDNAs 
containing the sequences of 14 of the 16 initial isolates were 
sequenced and were found to represent five different species of 
mRNA; three of the five species were identical to previously 
reported cDNA sequences of preproenkephalin, T-cell-replac- 
ing factor, and a serine esterase, respectively. The other two 
species, 4-1BB and L2G25B, were inducible sequences found in 
mRNA from both a cytolytic T-lymphocyte and a helper 
T-lymphocyte clone and were not previously described in T-cell 
mRNA; these mRNA sequences encode peptides of 256 and 92 
amino acids, respectively. Both peptides contain putative 
leader sequences. The protein encoded by 4-1BB also has a 
potential membrane anchor segment and other features also 
seen in known receptor proteins. 



Most T-cell factors have been classically identified by recog- 
nizing biologic activities in assays, purifying the protein, and 
then isolating the corresponding genes using bioactivities and 
protein information. An alternative approach is to isolate 
putative T-cell genes based upon specific expression and then 
to demonstrate the function of the unknown molecule (1-3). 
This laboratory has recently cloned a series of T-cell subset- 
specific cDNAs from cloned helper T-lymphocyte (HTL) L2 
cells and cloned cytolytic T-lymphocyte (CTL) L3 cells by 
employing a modified differential screening procedure (4). 
Subsequently, we isolated and sequenced apparent full-length 
cDNAs corresponding to 14 of the original clones representing 
five different species of mRNA. Here we summarize our 
studies and present the sequences of two cDNAs that do not 
correspond to previously reported T-cell mRNA species.^ 

MATERIALS AND METHODS 
Cells. We are grateful to F. Fitch of the University of 
Chicago for supplying us with the T-cell clones, L2 and L3 
cells (5). Cloned murine CTL L3 cells are Thy-1,2\ Lyt-2 + , 
LFA-1 + , L3T4", and H-2L d -reactive. Cloned murine HTL 
L2 cells are Thy-1,2 + , LFA-1\ Lyt-2", L3T4 + , and Mls a ' d - 
reactive. 

Before the isolation of RNA L2 cells were stimulated with 
Con A (10 /ig/ml) for 14 hr at a cell concentration of lOMO 7 
cells per ml. L3 cells were stimulated with Con A (2 fig/ml) 
for 14 hr at a cell concentration of 2.5 x 10 6 cells per ml. 
Mouse thymoma EL-4 cells (6) were stimulated with phorbol 
12-myristate 13-acetate (10 ng/ml) at a cell concentration of 
1.0 x 10 6 cells per ml for 20 hr; stimulation was monitored by 
interleukin-2 assay (7). RNA was isolated from the unstim- 
ulated B-cell lymphoma K46 (8) and rat natural killer cell 
large granular lymphocytes (9). 
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Isolation of T-Cell-Specific cDNA Clones. We have previ- 
ously isolated a group of cDNAs that are specific for T cells 
in contrast with B cells, employing both positive and negative 
differential screening and RNA blot analysis of various 
lymphoid cells. The T-cell-specific cDNAs were further 
studied to determine whether they were specific for cloned 
HTL L2 cells or cloned CTL L3 cells. This allowed us to 
isolate T-cell-subset transcripts that exist at a low level (4). 
The two cDNA sequences, designated as L2G25B and 
4-1BB, were isolated in this fashion and were referred to as 
L2G25#4 and L3G29#4, respectively, in an earlier report (4). 

RNA and DNA Blot Hybridization. Total cytoplasmic RNA 
or poly(A) + RNA was fractionated on 1.2% agarose-form- 
aldehyde gels and transferred to GeneScreenP/ws (NEN). 
Gel-purified cDNA inserts were 32 P-labeled by nick- 
translation and used as probes. When a Northern (RNA) blot 
of GeneScreen/'/Ms was used multiple times for hybridiza- 
tion, the previous probe was removed by placing the mem- 
brane in 10 mM Tris-HCl (pH 7.0)/0.2% sodium dodecyl 
sulfate (SDS) at 85°C for 1 hr. 

High-molecular weight DNA of mouse spleens was pre- 
pared as described (10). Restriction endonuclease digests of 
DNA were electrophoresed in 0.8% agarose gel at 4°C. The 
DNA was denatured and transferred to GeneScreen/ > /ws as 
described by Southern (11). The blot was hybridized with 
32 P-labeled cDNA. 

Screening of cDNA Library and DNA Sequencing. L2 and 
L3 cDNA libraries that had previously been prepared (4) 
were rescreened with cDNA inserts of each of the 14 
T-cell-specific cDNA fragments. Typically 10 positive clones 
were chosen for each species, and the sizes of cDNA inserts 
were determined. The longest cDNA inserts were employed 
for nucleotide sequencing by the dideoxynucleotide chain- 
termination technique (12) employing Sequenase (United 
States Biochemical), with modifications made to accommo- 
date 2'-deoxyadenosine 5'[a- 35 S]thiotriphosphate (13). 

Nucleotide and Protein Sequence Comparison. Full-length 
cDNA and predicted protein sequences were compared with 
the sequences in the National Institutes of Health GenBank 
Genetic Sequence Databank (Release 56), European Molec- 
ular Biology Laboratories (Release 16.0), and National Bio- 
medical Research Foundation (Release 17.0). Predicted pro- 
teins were analyzed by the pepplot program. 

RESULTS 

Table 1 summarizes T-cell cDNA identified from 14-hr Con 
A-stimulated L2 and L3 cDNA libraries. Besides the cDNAs 
listed in the table, granulocyte/macrophage colony stimulat- 
ing factor, interleukin 2, interleukin 3, T-cell antigen receptor 
a- and /3-chain, and c-myc cDNAs were identified by cross- 



Abbreviations: CTL, cytolytic T lymphocyte; HTL, helper T lympho- 
cyte. 

§The sequences reported in this paper are being deposited in the 
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Table 1. Summary of cDNA clones identified 



Full- 


cDNA clone 








isolated 


Specificity 




cDNA 


previously* 


of expression 


Identification 


4-1BB 


L3G29#4 


L2 and L3 


Unknown 




L3G25#4 


L2 and L3 






L3G14#2 






L2G25B 


L2G25#4 


L2 and L3 


Unknown (related 




L2G95#4 




to cDNA PLD78) 




L2G53#3 














L2S35 


L2S35#3 


L2 only 


Preproenkephalin 




L2PBK671 






8-1R 


L2PBK791 


L2 and EL-4 


T-cell-replacing 




L2PBK642 




factor 




L2PBK631 






L3G10 


L3G10#6 


L3 only 


HF gene (serine 




L3G18#3 




esterase) 


NDt 


L3G7#1 


L3 and EL-4 


Unknown 


NDt 


L3G26#1 


L3 and EL-4 


Unknown 



*cDNA clones were isolated independently and described as sepa- 
rate clones in the previous report (4). 
tFulMength version of the two clones was not isolated. 

hybridization of T-cell-enriched cDNAs with the correspond- 
ing full-length cDNA provided by other laboratories. 

Among the 16 unidentified T-cell cDNA clones isolated by 
this screen, two clones represented preproenkephalin that 
was identical to the sequence reported by Zurawski et al. 
(14), three clones were T-cell-replacing factor (15), and two 
clones represented a T-cell serine esterase gene (16). 

In the previous studies (4) we had also isolated an addi- 
tional 13 L3 cDNAs whose specificity could not be assigned 
by RNA blot analysis because of the weak signals. One of 
them (L3-1) was 64% homologous to a T-cell serine esterase 
(17). The sequence has been reported as a new member of the 
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T-cell serine esterase family (18). The other 12 clones have 
not yet been analyzed. 

Four species were from different regions of a cDNA 
referred to as L2G25B (800 base pairs). L2G25B was homol- 
ogous to a human cDNA PLD78 (19) of unknown function. 
Three species (L3G29#4, L3G25#4, and L3G14#2) were 
from different regions of 4-1BB (2400 base pairs). We have 
found ho reports of sequences homologous to 4-1BB. 
L3G7#1 and L3G26#1 were not characterized vigorously 
because they were represented at low levels in T-cell mRNA 
and were not inducible. 

Fig. L4 shows the nucleotide and deduced amino acid 
sequence of the longest open reading frame of L2G25B. The 
open reading frame codes for 92 amino acids including a 
putative signal sequence. The deduced sequence of the first 
23 amino acid residues has characteristics of the signal 
peptide of secretory proteins (23, 39). Therefore the mature 
protein is composed of 69 amino acids with a M T of 7880. No 
potential N-glycosylation sites or transmembrane-like do- 
main exists. The 3 '-untranslated region has repeated occur- 
rences of the sequence AUUUA (40) characteristic of lym- 
phokines and other rapidly degraded mRNA. 

An optimum alignment between L2G25B and PLD78 is 
shown in Fig. IB. The identity of amino acids between the 
two proteins was =80%. The evolutionary conservation of 
these molecules from mouse to human may indicate that they 
play an important role in T-cell function. Recently a family of 
lymphokine genes termed RANTES (regulated upon activa- 
tion, normal T expressed, and presumably secreted) has been 
defined to include PLD78 (20). After the present manuscript 
was prepared, a report appeared describing a macrophage 
inflammatory protein (21); the predicted amino acid sequence 
of this monokine is identical to that of L2G25B. 

The nucleotide sequence of three overlapping cDNA 
clones represented by 4-1BB was determined (Fig. 2). The 
nucleotide sequence of 4-1BB revealed a single long open 
reading frame, beginning with the ATG codon at nucleotide 



G TTCTGCTGAC AAGCTCACCC 



Fig. 1. (A) Nucleotide sequence of 
L2G25B and the deduced amino acid se- 
quence. Nucleotide sequence of the message 
strand is numbered in the 5' and 3' direction. 
The 5' noncoding sequence is indicated by 
negative numbers. Nucleotide residue 1 is 
the first nucleotide of the ATG initiation 
codon. The predicted amino acid sequence is 
shown below. Potential signal peptide is 
underlined, consensus polyadenylylation 
signal is boxed, and stop codon is indicated 
by — . (B) Optimum alignment between 
L2G25B and PLD78-deduced amino acids. 
Homology search revealed that the amino 
acid sequence of L2G25B showed an exten- 
sive homology with a reported human se- 
quence, PLD78 (15) of unknown function, 
whose expression is inducible by phorbol 
12-myristate 13-acetate in human tonsillar 
lymphocytes. The identity of amino acids 
between the two proteins was =80%. *, 
Identical amino acids in these proteins; +, 
chemically similar amino acids found in both 
sequences; A, a potential cleavage site of the 
signal peptide. KWON000043 
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He Phe Lys Gin Pi 
Cys Arg Cys Pro G] 



y Gly Gly Gly Gly Tyr ( 



Fig. 2. Nucleotide sequence and the de- 
duced amino acid sequence of 4-1BB. The 
nucleotides of the message strand are num- 
bered in the 5' and 3' direction, and numbers 
are shown on both sides of the sequence. 
Nucleotide residue 1 is the adenine of the 
initiation codon ATG, and the nucleotides on 
the 5' side of residue 1 are indicated by 
negative numbers. The predicted amino acid 
sequence is shown below the nucleotide 
sequence. Putative signal peptide and the 
potential asparagine-linked glycosylation 
sites are underlined; the potential transmem- 
brane domain is double underlined. The 
potential polyadenylylation signal is boxed. 
N, ambiguous nucleotide sequences in the 3' 
untranslated region; — , stop codon; •, cys- 
teine residues. 



residues 1-3. This reading frame codes for a polypeptide of 
256 amino acids with a M T of 27,587. The assigned ATG is 
preceded by an in-frame termination codon TGA (nucleotide 
residues -12 to -9). The sequence flanking the assigned 
ATG (nucleotide residues -5 to 4) contains eight of nine 
residues identical to the consensus sequence (CCRC- 
CATGG, where R represents guanosine or adenine) de- 
scribed by Kozak (22). The codon specifying the carboxyl- 
terminal leucine is followed by the translational termination 
codon TGA (nucleotide residues 769-771). Clone 4-1BB 
contains an unusually long 3 '-untranslated sequence, which 
did not extend as far as the poly(A) + tail. A potential 
polyadenylylation signal is present at nucleotides 1158-1163 
(Fig. 2, boxed). This signal may sometimes be functional 
because this gene produces at least two different sizes of 
mRNA. Fig. 3 a and b shows RNA blot analysis of Con 
A-stimulated L3 RNA. When the blot was hybridized to an 
L3G25#4 probe that contained sequences to the 3' side of the 
polyadenylylation signal (nucleotides 1284-1557), the probe 
detected one RNA species of =2.4 kilobases (kb). When the 



same blot was hybridized to an L3G14#2 probe that con- 
tained sequences to the 5' side of the polyadenylylation signal 
(nucleotides 661-855) the probe detected two mRNA species 
of =1.5 kb and 2.4 kb. Only one copy of the sequence ATTTA 
that is often present in multiple copies in the 3 '-untranslated 
regions of unstable mRNA was present in 4-1BB, and further 
studies of the stability of its mRNA are needed. Northern blot 
analysis has shown that 4-1BB mRNA is inducible in both L2 
and L3 cells and is present in several CTL clones and 
hybridomas but is absent in the EL-4 T-cell line (B.S.K., 
unpublished results). 

The deduced sequence of the first 23 amino acids of 4-1BB 
cDNA has characteristics of the signal peptide of secretory 
and membrane-associated proteins (23) and fits the -1, -3 
rule (24). We putatively assigned the first 23 amino acids as 
a signal peptide, although the presence of lysine at -4 and 
glutamic acid at -5 are somewhat unusual. Thus, the protein 
backbone of processed 4-1BB protein would be composed of 
233 amino acids with a M r of 25,167. Two potential aspar- 
agine-linked glycosylation signals (25, 26) are located at 

KWON000044 
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Fig. 3. Expression of two different sizes of 4-1BB mRNA. Ten micrograms of poly(A) + mRNA from mouse B-cell line (K46), phorbol 
12-myristate 13-acetate-stimulated EL-4 (EL-4 TPA), and rat natural killer cell line (LGL) and 10 ng of total RNA from unstimulated L3 (L3) 
and Con A-stimulated L3 (L3 ConA) were fractionated on a 1.4% formaldehyde-agarose gel, transferred to GeneScreen/>/(«, and hybridized 
to 32 P-labeled L3G25#4 (a), L3G14#2 (b), and L3G20#3 (c) sequentially. L3G25#4 and L3G14#2 represent cDNA fragments of the 3' side 
and 5' side to boxed AATAAA sequence, respectively. L3G20#3 is an anonymous cDNA from L3 cDNA library and is used to show that each 
lane of the blot contains a similar amount of RNA. Positions of 28S and 18S rRNA markers are each indicated; arrows indicate the specific 
hybridization signals. 



amino acid positions 128 and 138 as underlined in Fig. 2. The 
predicted 4-1BB protein contained an unusually large number 
of cysteines. There are 23 cysteine residues in the putative 
mature protein as marked with dots in Fig. 2. These residues 
are arranged with a spacing reminiscent of that seen in several 
groups of proteins including zinc finger DNA-binding pro- 
teins (27), epidermal growth factor receptor (28), the Dro- 
sophila Notch locus product (29), and certain translation 
factors (30, 31). Amino acids flanking certain of the cysteine 
groups also resemble sequences found in other proteins. For 
example, with a replacement of leucine by isoleucine, there 
is an exact match to seven of eight amino acids of a putative 
zinc finger structure in the yeast elF-2/3 protein (31). These 
residues may represent metal binding sites, but, as discussed 
by others (32), their role in the functions of a protein cannot 
be uniquely predicted. 

Following the cysteine-rich regions of the protein is a 
stretch of amino acids (residues 140-185) in which almost 
30% of the amino acids are serine or threonine; these are 
potential sites for O-linked glycosylation and are reminiscent 
of those seen, for example, in the low density lipoprotein 
receptor (33). There is a stretch of 26 amino acids that 
constitutes a hydrophobic domain toward the carboxyl ter- 
minus of the protein (amino acids 186-211) (Fig. 4), followed 
by a sequence containing several basic residues. This region 
may serve as a membrane-spanning anchor domain. 

Finally, the relatively short carboxyl-terminal putative 
cytoplasmic domain contains both two short runs of three and 
four acidic residues, respectively, and a sequence of five 
glycines followed by a tyrosine. The short stretches of acidic 
residues are also found in the cytoplasmic domain of the low 



density lipoprotein and atrial natriuretic peptide receptors, 
for example (33, 35). 

Southern Blot Analysis. As shown in Fig. 5, a fragment of 
L2G25B and of 4-1BB cDNA each detects a single-restriction 
fragment of =45 kb and 18 kb in C57BL/6 and BALB/c 
DNA, respectively. The data indicated that the genes encod- 
ing the two molecules exist as a single copy in C57BL/6 and 
BALB/c mice. 

DISCUSSION 

We and others have developed protocols for differential 
screening of cDNA libraries by which one can detect a broad 
representation of the mRNA differentially expressed in two 
cell types (1-4). The method was applied to the systematic 
analysis of HTL and CTL gene expression and offered an 
alternative approach to classical protein purification for 
identifying effector molecules and genes. The genes for the 
T-cell antigen receptor, certain T-cell-specific effectors, and 
the probable gene mutated in an X chromosome-linked 
immunodeficiency syndrome of mice (36) were cloned and 
characterized in this fashion, and the set of clones described 
in this and a previous report were obtained before most of the 
individual species represented had been isolated by other 
methods. 

A number of lymphokine clones, including interleukins 2, 
3, and 5 and granulocyte/macrophage colony-stimulating 
factor as well as other T-cell-specific clones such as the serine 
esterases and T-cell receptors, were selected in the relatively 
small libraries (total of 18,000 L2 clones and 10,000 L3 
clones) whose screening we have reported here and else- 
where. Probably the bulk of the most abundant T-cell-specific 



Fig. 4. Hydropathicity profile of the predicted amino acid sequence of 4-1BB. Local hydropathicity values calculated by the method of Kyte 
and Doolittle (34) were plotted versus amino acid residues. Positive values indicate hydrophobic regions, and negative values indicate hydrophilic 
regions. 
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Fig. 5. Southern blot analysis of mouse genomic DNA. Genomic 
DNA from C57BL/6 (lanes 1 and 3) and BALB/c (lanes 2 and 4) was 
digested with EcoRl restriction enzyme, fractionated on a 0.8% 
agarose gel, transferred to GeneScreen/Vus, and hybridized to 
32 P-labeled L2G25B (lanes 1 and 2) and 4-1BB (lanes 3 and 4). 

sequences were detected. Nevertheless, other potent bio- 
logic effectors may well have been missed. For example, L3 
secretes large amounts of interferon -y, but, for unknown 
reasons, we did not recover a cDNA clone for this molecule. 
No clone for any member of the RANTES family other than 
L2G25B was detected, even though mRNA for such genes 
may be among the most abundant mRNA species produced 
in T cells. Similarly, clones for other T-cell activation 
antigens and T-cell-specific membrane channel-forming pro- 
teins were not detected. 

Clones of the cDNA of 4-1BB were obtained from both the 
L2 and L3 cells. The clones corresponding to L2G25B were 
all isolated from L2 cells, and the possibility remains that the 
mRNA sequences detected in L3 corresponded to a different 
member of this gene family. The two species of cDNA 
reported here represent the two abundant, inducible se- 
quences detected in both the HTL and CTL clones used in the 
present study. The induction properties of both clones 
resembled those of other lymphokines in that induction could 
be detected within 30 min after stimulation; they could both 
be induced either by Con A or by antigen stimulation. Unlike 
interleukin 2 receptor mRNA (37), induction of both mRNA 
species was blocked by treatment with cyclosporine A 
(B.S.K., unpublished work). Because 4-1BB protein was 
inducible in both T-cell clones, it may play roles in the T-cell 
activation process and function of the activated T cells less 
specific to a certain T-cell subset than those of the well- 
described lymphokines. As discussed above, 4-1BB has a 
number of properties reminiscent of membrane receptors and 
transmembrane signaling proteins. However, in at least the 
instances of the Drosophila Notch and sevenless loci (29, 38), 
an apparent transmembrane protein seems capable of acting 
directly or indirectly in trans on other cells. 
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